Investigating Some Imputation Methods of Multivariate Imputation Chained Equations

نویسندگان

چکیده

This paper investigates three MICE methods: Predictive Mean Matching (PMM), Quantile Regression-based Multiple Imputation (QR-basedMI) and Simple Random Sampling (SRSI) at imputation numbers 5, 15, 20 30 with 5% 20% missing values, to ascertain the one that produces imputed values best matches observed compare model fit based on AIC MSE. The results show that; QR-basedMI produced more didn’t match observed, SRSI better as number of imputations increases while PMM matched all missingness considered. for showed in terms MSE except M=15, result M= M=15 M=20 results. shows considered both where was seen produce It is concluded comparison, most suited when but QR-basedMI.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

mice: Multivariate Imputation by Chained Equations in R

The R package mice imputes incomplete multivariate data by chained equations. The software mice 1.0 appeared in the year 2000 as an S-PLUS library, and in 2001 as an R package. mice 1.0 introduced predictor selection, passive imputation and automatic pooling. This article documents mice 2.9, which extends the functionality of mice 1.0 in several ways. In mice 2.9, the analysis of imputed data i...

متن کامل

Multiple Imputation by Chained Equations in Praxis: Guidelines and Review

Multiple imputation by chained equations (MICE) is an effective tool to handle missing data an almost unavoidable problem in quantitative data analysis. However, despite the empirical and theoretical evidence supporting the use of MICE, researchers in the social sciences often resort to inferior approaches unnecessarily risking erroneous results. The complexity of the decision process when enco...

متن کامل

Effect of Reference Population Size and Imputation Methods on the Accuracy of Imputation in Pure and Mixed Populations

    Imputation as a method of creating low-density chips to high-density chips has been introduced to increase the accuracy of genomic selection in animals. In the current study, to investing imputation accuracy, three populations of mixed (scenario 1), pure (scenario 2) and mixed + pure (scenario 3) were simulated using QMSim. Two methods of imputation including Beagle and Flmpute were used fo...

متن کامل

Kernel imputation with multivariate auxiliaries

We consider a data set with missing observations but known auxiliaries for the sample and develop a real donor imputation. For each unit with missing observations we construct a distribution over a set of possible donors. We want the expectation (or distribution) to be chosen so that the expectation (or distribution) of the imputed values should equal the distribution of the units’ true values....

متن کامل

Influence of Outliers on Some Multiple Imputation Methods

In the field of data quality, imputation is the most used method for handling missing data. The performance of imputation techniques is influenced by various factors, especially when data represent only a sample of population, for example the survey design characteristics. In this paper, we compare the results of different multiple imputation methods in terms of final estimates when outliers oc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: European Journal of Mathematics and Statistics

سال: 2022

ISSN: ['2736-5484']

DOI: https://doi.org/10.24018/ejmath.2022.3.3.109